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Abstract 

LRM-Trees are an elegant way to partition a sequence of values into sorted consecutive 
blocks, and to express the relative position of the first element of each block within a previous 
block. They were used to encode ordinal trees and to index integer arrays in order to support 
range minimum queries on them. We describe how they yield many other convenient results 
in a variety of areas, from data structures to algorithms: some compressed succinct indices 
for range minimum queries; a new adaptive sorting algorithm; and a compressed succinct data 
structure for permutations supporting direct and indirect application in time all the shortest 
as the permutation is compressible. As part of our review preliminary work, we also give an 
overview of the, sometimes redundant, terminology relative to succinct data-structures and 
indices. 



1 Introduction 

Introduced by Fischer [8] as an indexing data structure which supports range minimum queries 
(RMQ) in constant time and zero access to the main data, and by Sadakane and Navarro [26) 
to support navigation operators on ordinal trees, Left-to-Right-Minima Trees (LRM-Trees) are an 
elegant way to partition a sequence of values into sorted consecutive blocks, and to express the 
relative position of the first element of each block within a previous block. 

We describe in this extended abstract how the use of LRM-Trees and variants yields many other 
convenient results in a variety of areas, from data structures to algorithms: 

1. We define several compressed succinct indices supporting Range Minimum Queries (RMQs), 
which use less space than the 2n + o(n) bits used by the succinct index proposed by Fischer [8] 
when the indexed array is partially sorted. Note that although a space of 2n bits is optimal 
in the worst case over all possible permutations of size n, this is not necessarily optimal on 
more restricted classes of permutations. For example, if A = [1, 2, . . . , n], it is possible to 
support RMQs on A without any additional space. Although there is a RMQ succinct index 
that exploits the compressibility of A [9j, it only takes advantage of repetitions in the input 
and would still use 2n + o(n) bits for the example above. 
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2. 



We propose a new sorting algorithm and its adaptive analysis, asymptotically superior to 
adaptive merge sort [2], and superior in practice to Levcopoulos and Petersson's sorting 
algorithms 18 . 



3. We design a compressed succinct data structure for permutations, which uses less space than 
the previous compressed succinct data structure from Barbay and Navarro (2j, and supports 
the access operator and its inverse in time all the shortest as the permutation is compressible, 
and range minimum queries and previous smaller value queries in constant time. 

All our results are in the word RAM model, where it is assumed that we can do arithmetic and 
logical operations on tu-bit wide words in 0(1) time, and w = Q(lgn). The following section gives 
examples of results that have been obtained in this natural model; we start by giving an overview 
of the, sometimes redundant, concepts on succinct data structures and succinct indices. 

2 Previous Work and Concepts 

2.1 On the Various Types of Succinct Data Structures 

Some concepts (e.g., succinct indices and systematic data structures) on succinct data structures 
were invented more than once, at similar times but with distinct names, which makes their classi- 
fication more complicated than necessary. Given that our results cross several areas (namely, com- 
pressed succinct data structures for permutations and indices supporting range minimum queries), 
which each use distinct names, we aim in this section to clarify the potential overlaps of concepts, 
to the extent of our knowledge. 

A Data Structure T> (e.g., run encoding of permutations (2]) specifies how to encode data from 
some Data Type T (e.g., permutations) so that to support the operators specified by a given 
Abstract Data Type A (e.g., direct and inverse applications). Naturally, a data structure usually 
requires more space than a simple encoding scheme of the same data-type, given that it supports 
operators in addition to just memorize the data: the amount of additional space required is called 
the redundancy of the data structure. 

A Succinct Data Structure [15] is a data structure whose redundancy is asymptotically negligible 
as compared to the space required to encode the data itself, in the worst or uniform average case 
over all instances of fixed size n (e.g., a succinct data structure for bit vectors using n + o(n) bits). 
An Ultra- Succinct Data Structure [l6j is a compressed data-structure (w.r.t. a parameter measuring 
the compressibility of the data) whose redundancy is asymptotically negligible as compared to the 
space required to encode the data in the worst case over all instances for which the size n is fixed 
(e.g., an ultra-succinct data-structure for binary strings [23] uses nH$ + o(n) bits, where Hq is 
the entropy (information content) of the string). A Compressed Succinct Data Structure [2] is a 
compressed data structure whose redundancy is asymptotically negligible as compared to the space 
required to compress the data itself in the worst or average case over all instances for which the size 
n is fixed (e.g., a compressed succinct data structure for binary strings uses uHq + o(uHq) bits). 

An Index is a structure which, given access to some data structure T> supporting a defined 
abstract data type A (e.g., a data structure for strings supporting the access operator), extends 
the set of operators supported in good time to a more general abstract data type A' (e.g., rank 
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and select operators on strings) [j] By analogy with succinct data structures, the space used by an 
index is called redundancy. A Succinct Index n] or Systematic Data Structure (ill I is simply 
an index whose redundancy is negligible in comparison to the space required by T> in the worst 
case over instances of fixed size n. The separation between a data structure and its index was 



implicitly used before its formalization 25 and explicitly to prove lower bounds on the trade-off 
between space and supporting time of succinct data structures [12] . Of course, if V is a succinct 
data structure, then the data structure formed by the union of V and X is a succinct data structure 
as well: this modularity permits the combination of succinct indices for distinct abstract data 
types on similar data types [l]. A Compressed Succinct Index is an index whose redundancy is 
negligible in comparison to the space required by T> in the worst case over instances of fixed size 
n, as well as decreasing with a given measure of compressibility of the index (e.g. the short-cut 



data-structure 20 supporting ir 1 () uses space inversely proportional to the length of cycles in the 
permutation ir). 

The terms of integrated encoding [T], self-index 19 , non- systematic data structure [8j[ll] or 
encoding data structure [4] refer to a data structure which does not require access to any other data 
structure than itself, as opposed to a succinct index. In the case of integrated encodings [T] and 
self-indices |19| , there is no need for any other data structure, as they re-code all information and 
hence provide their own mechanism for accessing the data. Those data structures are considered 
less practical from the point of view of modularity, but this approach has the advantage of yielding 
potentially lower redundancies: Golynski [12] showed that if a bit vector B is stored verbatim using 
n bits, then every index supporting the operators access, rank, and select must have redundancy 



^nig]gn) bitg) while patrascu 22 gave an integrated encoding for B with redundancy 0{ 
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bits. In the case of non-systematic data structures [8.11 and encoding data structures [4[, the 
emphasis is that those indexing data structures require much less space than the data they index, 
and being able to answer some queries (other than access, obviously) without any access to the 
main data. Of course, such an index can be seen as a data structure itself, for a distinct data type 
(e.g., a Lowest Common Ancestor non-systematic succinct index of 2n + o(n) bits for labeled trees 
is also a simple data structure for ordinal trees): those notions are relative to their context. 

Following the model of Daskalakis et al.'s analysis U\ of sorting algorithms for partial orders, 
we distinguish the data complexity and the index complexity of both algorithms and succinct in- 
dices, measuring separately the number of operations it performs on the data and on the index, 
respectively. Following these definitions, a non-systematic data structure is a succinct index of 
data complexity equal to zero, and the usual complexity of a succinct index is the sum of its data 
complexity with its index complexity. This distinction is important for instance when we consider 
a semi-external memory model, where it could occur that the data structure is too large to reside 
in main memory and is therefore kept in external memory (which is expensive to access), but its 
index is small enough to be stored in RAM. In such a case it is preferable to use a succinct index 
of minimal data complexity. 



1 The fundamental rank and select operators on a bit-vector B are defined as follows: rank\ (B, i) gives the number 
of l's in the prefix B[l, i], and selech(B, i) gives the position of the i-th 1 in B, reading B from left to right (1 < i < n). 
Operations ranko(B,i) and selecto(B,i) are defined analogously for 0-bits. 
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Figure 1: The LRM-Tree 7a of the input array A, and its DFUDS U. 
2.2 Left-to-Right-Minima Trees 

LRM- Trees are an elegant way to partition a sequence of values into sorted consecutive blocks, and 
to express the relative position of the first element of each block within a previous block. They 
were introduced under this name as an internal tool for basic navigational operations in ordinal 
trees |26| and, under the name of "2d-Min Heaps," to index integer arrays in order to support range 
minimum queries on them [8j. 

Let A[l,n] be an integer array. For technical reasons, we define A[0] = — oo as the "artificial" 
overall minimum of the array. 

Definition 1 (Fischer [8j; Sadakane and Navarro [26]). For 1 < i < n, let psv A(i) = max{j £ 
[0..i — 1] : A[j] < A[i]} denote the previous smaller value of position i. The Left-to-Right- 
Minima Tree (LRM-Tree) Ta of A is an ordered labeled tree with vertices 0, . . . ,n. For 1 < i < n, 
PSV A(i) is the parent node of i. The children are ordered in increasing order from left to right. 

See Fig. [T] for an example of LRM- Trees. The following lemma shows a simple way to construct 
the LRM-Tree in linear time (Fischer [8] gave a more complicated linear-time algorithm with 
advantages that are irrelevant for this paper.) 

Lemma 2. There is an algorithm computing the LRM-Tree of an array of n integers in at most 
2n data comparisons. 

Proof. The computation of the LRM-Tree corresponds to a simple scan over the input array, 
starting at A[0] = — oo, building down iteratively the current rightmost branch of the tree with 
increasing elements of the sequence till an element x smaller than its predecessor is encountered, at 
which point one climbs the right-most branch up to the first node v holding a value smaller than x, 
and starts a new branch with a right-most child of v of value x. As the root of the tree has value 
A[0] = — oo smaller than all elements, the algorithm always terminates. 

The construction algorithm performs at most 2n comparisons. Charging the last comparison 
performed during the insertion of an element x to x itself, and all previous comparisons to the 
elements already in the LRM-Tree, each element is charged at most twice: once when it is inserted 
into the tree, and once when scanning it while searching for a smaller value on the rightmost branch. 
As in the latter case all scanned elements are removed from the rightmost path, this second charging 
occurs at most once for each element. ■ 
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2.3 Range Minimum Queries 



We consider the following queries on a static array A[l, n] (parameters i and j with 1 < i < j < n): 
Definition 3 (Range Minimum Queries). RMQ^(i,j) = position of the minimum in A[i,j]. 

RMQs have a wide range of applications for various data structures and algorithms, including 
text indexing 10 , pattern matching [6], and more elaborate kinds of range queries [jjj. 



The connection between LRM- Trees and RMQs is given as follows. For two given nodes i and j 
in a tree T, let LCAt(z, j) denote their Lowest Common Ancestor (LCA), which is the deepest node 
that is an ancestor of both i and j. Now let Ta be the LRM- Tree of A. For arbitrary nodes i and 
j in Ta, 1 < i < j < n, let £ = hCAj- A (i, j). Then if £ = i, RMQ A (i,j) is given by i, and otherwise, 
RMQ j 4(f, j) is given by the child of t that is on the path from I to j [8]. 

Since there are succinct data structures supporting the LCA operatoiQ in succinctly encoded 
trees in constant time, this yields a succinct index: 

Lemma 4 (Fischer [§]). For an array A[l,n] of totally ordered objects, there is a non- systematic 
succinct index using 2n + o(n) bits and supporting RMQs in zero data queries and 0(1) index 
queries. This index can be built using at most 0(n) data comparisons. 



2.4 Adaptive Sorting, and Compression of Permutations 

Sorting a permutation of n elements in the comparison model typically requires J7(nlgn) com- 
parisons in the worst case. Yet, better results can be achieved for some parameterized classes of 
permutations. Among others, Knuth yL7\ considered Runs (ascending subsequences), counted by 
nRuns(7r) = 1 + |{i : 1 < % < n, iTi+i < 7Tj}|; Levcopoulos and Petersson [18] introduced Shuffled Up 
Sequences, counted by nSUS(7r) = min{fc : tt is covered by k increasing subsequences}, and Shuffled 
Monotone Sequences, counted by nSMS(7r) = min{/c : tt is covered by k monotone subsequences}; 
and Barbay and Navarro [2j introduced strict variants of those concepts, namely Strict Runs and 
Strict Shuffled Up Sequences, where sorted subsequence are composed of consecutive integers (e.g. 
(2,3,4,1,5,5, 7,8) has two runs but three strict runs), counted by nSRuns and nSSUS, respec- 
tively. For any "measure of disorder" X among those five, there is a variant of the merge-sort 
algorithm which sorts a permutation tt of size n and measure X in time 0((n + 1) lgX), which is op- 
timal in the worst case among instances of fixed size n and fixed values of X (this is not necessarily 
true for other measures of disorder). 

As the merging cost induced by a subsequence is increasing with its length, the sorting time of 
a permutation can be improved by rebalancing the merging tree pi. This merging cost is actually 
equivalent to the cost of encoding, for each element of the sorted permutation, the subsequence 
of origin of this element. Hence rebalancing the merging tree is equivalent to optimize a code for 
those addresses, and can be done via a Huffman tree [l4]. The complexity can then be expressed 
more precisely in function of the entropy of the relative sizes of the sorted subsequences identified, 
where the entropy H(Seq) of a sequence Seq = {n\,U2, ■ ■ ■ ,n r ) of r positive integers adding up to 
n is 'H(Seq) = Y7i=i 7fl§^ which satisfies (r — l)lgn < n%(Seq) < nlgr (by concavity of the 
logarithm) . 

2 The inherent connection between RMQs and LCAs has been exploited also in the other direction [3]. 
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Barbay and Navarro [2] observed that each such algorithm from the comparison model also 
describes an encoding of the permutation tt that it sorts, so that it can be used to compress 
permutations from specific classes to less than the information-theoretic lower bound of n lg n bits. 
Furthermore they used the similarity of the execution of the merge-sort algorithm with a Wavelet 
Tree jl3|, to support the application of 7r() and its inverse ir~ 1 () in time logarithmic in the disorder 
of the permutation it as measured by nRuns, nSRuns, nSUS, nSSUS and nSMS, respectively) in the 
worst case. We summarize their technique in Lemma [5] below, in a way independent of the partition 
chosen for the permutation. 

Lemma 5 (Barbay et al. [2]). Given a partition Seq of a permutation tt of n elements into nSeq 
sorted subsequences of respective lengths Seq ; these subsequences can be merged with n(l + %(Seq)) 
comparisons on tt and 0(nSeq lg nSeq) internal operations, and this merging can be encoded using 
at most (1 + %(Seq))(n + o(n)) + O(nSeqlgn) bits so that it supports the computation of n(i) and 
7r _1 (i) in time 0(1 + IgnSeq) in the worst case and in time 0(1 + %(Seq)) on average when i is 
chosen uniformly at random in [l..n]. 



3 Compressed Succinct Indexes for Range Minima 

We now explain how to improve on the result from Lemma [4] for permutations that are partially 
ordered. Without loss of generality, we consider only the case where the input is a permutation of 
[l..n]: if this is not the case, we can sort the elements in A by rank, considering earlier occurrences 
of equal elements as smaller. 

3.1 Strict Runs 

The simplest compressed data structure for RMQs uses an amount of space which is a function 
of nSRuns, the number of strict runs in tt. It uses 2nSRuns + o(n) bits on permutations where 
nSRuns G o(n): 

Theorem 1. There is a non-systematic compressed succinct index using 2nSRuns+ [lg ( n sR uns )l + 
o(n) bits and supporting RMQs in zero data queries and 0(1) index queries. 

Proof. We mark the beginnings of each runs in A with a 1 in a bit- vector i?[l,n], and represent 
B with the compressed succinct data structure from Raman et al. [23], using [lg ( nS R uns )l + o(n) 
bits. Further, we define A' as the (conceptual) array consisting of the heads of ^4's runs (A'[i] = 
A[selecti(B,i)}). We build the LRM-Tree from Lemma|4]on A'; using 2nSRuns(l + o(l)) bits. To 
answer a query RMQ^(i,j), compute x = rank±(B,i) and y = rank\(B, j), and compute m' = 
RMQ j 4/(x, y) as a range minimum in A' , and map it back to its position in A by m = selecti(B, to'). 
Then if to < i, return i as the final answer to RMQ^(i, j), otherwise return m. The correctness 
from this algorithm follows from the fact that only i and the heads of strict runs that are entirely 
contained in the query interval can be the range minimum; the former occurs if and only if the 
head of the run containing i is smaller than all other heads in the query range. ■ 

Obviously, this compressed data-structure is interesting only if nSRuns G o(n). We explore in the 
following section a more general measure of partial order, nRuns. 
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3.2 General Runs 



The same idea as in Theorem [T] applied to more general runs yields another compressed succinct 
index for RMQs, potentially smaller but this time requiring to access the input to answer RMQs. 

Theorem 2. There is a systematic compressed succinct index using 2nRuns + [~lg ( n Ru ns )] + o{n) 
bits and supporting RMQs in 1 data comparison and 0(1) index operations. 

Proof. We build the same data structures as in TheoremjTJ now using 2nRuns+ [~lg ( nR ™ ns )l +o(n) 
bits. To answer a query rmq^^j), compute x = rank\\B,i) and y = ranki(B,j). If x = y, 
return %. Otherwise, compute m! = rmq^/(x + l,y), and map it back to its position in A by 
m = selecti(B,m'). The final answer is k if A[k] < A[m], and m otherwise. ■ 

To achieve a non-systematic compressed succinct index whose space usage is a function of nRuns, 
we need more space and a more heavy machinery, as shown next. The main idea is that a permu- 
tation with few runs results in a compressible LRM-Tree, where many nodes have out-degree 1. 

Theorem 3. There is a non- systematic compressed succinct index using 2nRunslgra + o(n) bits, 
and supporting RMQs in zero data comparisons and 0(1) index operations. 



Proof. We build the LRM-Tree Ta from Sect. 2.2 directly on A, and then compress it with the 
tree-compressor due to Jansson et al. [16] . 

To see that this results in the claimed space, let n/% denote the number of nodes in Ta with out- 
degree k > 0. Let . . . , (i n Runsi jnRuns]) be an encoding of the runs in A as (start, end), and 
look at a pair (i x ,jx)- We have PSV A(k) = k — 1 for all k S [i x + l-.j x ], and so the nodes in [ias.-Jx] 
form a path in Ta, possibly interrupted by branches stemming from heads i y of other runs y > x 
with PSVA{iy) £ [ix--jx — 1]- Hence uq = nRuns, and n\ > n — nRuns — (nRuns — 1) > n — 2nRuns, 
as in the worst case the values PSV/^iy) for i y E {12,^3, • • • , ^nRuns} are all different. 

Now Ta, with degree-distribution no, . . . ,n n _i, is compressed into uH*(Ta) + O ( n ^ n n ^ bits, 
where 

nH*(T A ) = lgf 1 f 

\n \n ni . ..n n -i 

is the so-called tree entropy [16] of Ta- This representation supports all navigational operations in 
Ta in constant time, and in particular those required for Lemma [4] A rough inequality yields a 
bound on the number of possible LRM- Trees: 

n \ n! n! n! 9 „D linc 

< — < < n 2nRuns , 



K nQni . . . n n _iy no!ni! . . . n n _i! n\\ (n — 2nRuns)! 
from which one easily bounds the space usage of the compressed succinct index: 

1 2nRuns \ _ i„ / „2nRuns-l 



nH*{T) < lg I -n ^ nKuns j = lg ^ nKuns - i J = (2nRuns - 1) lgn < 2nRunslgn . 
Adding the space required to index the structure of Jansson et al. [16] yields the desired space. 
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4 Sorting Permutations 



Barbay and Navarro [2] showed how to use the decomposition of a permutation tt in nRuns ascending 
consecutive runs of respective lengths Runs to sort adaptively to their entropy %(Runs). Those runs 
entirely partition the LRM-Tree of ir into nRuns paths, each starting at some branching node of 
the tree, and ending at a leaf: one can easily draw this partition by iteratively tagging the leftmost 
maximal untagged up-from-leaf path of the LRM-Tree. 

Yet, any partition of the LRM-Tree into down paths (so that the values traversed by the path are 
increasing) can be used to sort tt. Since there are exactly nRuns leaves in the LRM-Tree, no such 
partition can be smaller than the partition of tt into ascending consecutive runs. But in the case 
where some of those partitions are more imbalanced than the original one, this yields a partition 
of smaller entropy, and hence a faster sorting algorithm. We define a family of such partitions: 

Definition 6 (LRM-Partition). A LRM-Partition of a permutation tt with LRM-Tree T n is defined 
recursively as follows. One subsequence is the "spinal chord" ofT n , one of the longest root-to-leaf 
paths in %r • Removing this spinal chord of T-k leaves a forest of more shallow trees. The rest of the 
partition is obtained by computing and concatenating some LRM-partitions of those trees. 

This definition does not define a unique partition, but a family of partitions: there might be 
several ways to choose the "spinal chord" of each subtree when several nodes have the same depth, 
and of course the order of the subsequences in the partition does not matter either. Yet, there 
will always be nRuns many subsequences in the partition, and any LRM-Partition is never worse 
and often better (in terms of sorting and compressing) than the the original Run-Partition. The 
situation is similar to the one of %(SUS) versus nSUS: it is easier to minimize nSUS (resp. nRuns) 
than %(SUS) (resp. H(LRM)), yet one can take advantage of the entropy of a partition minimizing 
nSUS (resp. of a LRM-Partition). 

Note that each down-path of the LRM-Tree corresponds to an ascending subsequence of tt, but 
not all ascending subsequences correspond to down-paths of the LRM-Tree, hence partitioning 
optimally tt into nSUS ascending subsequences potentially yields smaller partitions, or ones of 
smaller entropy: the LRM-partitions seem inferior to SUS-partitions. Yet, the fact which make 
LRM-Partitions particularly interesting is that it can be computed in linear time (which is not true 
for SUS-Partitions): 

Lemma 7. There is an algorithm finding one of the LRM-Partitions of a permutation tt of size n 
in 0{n) data comparisons. 

Proof. Definition [6] is constructive: we are only left to show that this algorithm can be executed 
in linear time. Having built Ta using Lemma [2] in 2n comparisons, we first set up an array D 
containing the depths of the nodes in Ta, listed in preorder. We then index D for range maximum 
queries in linear time using Lemma [4} 

Now the deepest node in Ta can be found by a range maximum query over the whole array, 
supported in constant time. From this node, we follow the path to the root, and save the corre- 
sponding nodes as the first subsequence. This divides A into disconnected subsequences, which 
can be processed recursively using the same algorithm, as the nodes in any sub-tree of Ta form an 
interval in D. We do so until all elements in A have been assigned to a subsequence. 
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Note that in the recursive steps, the numbers in D are not anymore the depths of the corre- 
sponding nodes in the remaining sub-trees. But as all depths listed in D differ by the same offset 
from their depths in any connected subtree, this does not affect the result of the range maximum 
query. ■ 

Given a LRM-Partition of the permutation ir, sorting it is just a matter of applying Lemma [5] 

Theorem 4. Let it be a permutation of size n. Identifying its nRuns runs by building the LRM-Tree 
through Lemma obtaining a LRM-Partition of subsequences of respective lengths LRM through 
Lemma [?| and merging the subsequences of this partition through Lemma [5| results in an algo- 
rithm sorting it in a total o/n(3 + %(LRM)) data comparisons and 0{n + nRuns lg nRuns) internal 
operations, accounting for a total time of 0{n{\ + "H(LRM))). 

Proof. Lemma [2] builds the LRM-Tree in 2n data comparisons, Lemma [7] extract from it a LRM- 
Partition in 0(n) internal operations, and Lemma [5] merges the subsequences of the LRM-Partition 
in n(l + %(LRM)) data comparisons and 0(nRunslg nRuns) internal operations. The sum of those 
complexities yields ra(3+%(LRM)) data comparisons and C(n-(-nRunslg nRuns) internal operations. 

Since nRuns lg nRuns < n%(LRM) + lg nRuns by concavity of the logarithm, the total time com- 
plexity is in 0(n(l + H(LRM))). U 

Since by construction %(LRM) < %(Runs), this result naturally improves on the adaptive merge 
sort algorithm for runs |2j. However, ft (SUS) can be arbitrarily smaller than ft (LRM): this means 
that, in the worst case over instances of fixed n and ft(SUS), SUS sorting has a strictly better 
asymptotical complexity than LRM sorting; while, in the worst case over instances of fixed n and 
ft(LRM), SUS sorting has the same asymptotical complexity than LRM sorting. 

Yet, on instances where ft(LRM) < 2ft(SSUS) — 1, LRM-Sorting actually performs less data 
comparisons (and potentially more index operations) than SUS-Sorting. Barbay et al.'s improve- 
ment |2| of SUS-Sorting performs 2n(l+ft(SUS) data comparisons, decomposed into n(l+ft(SUS)) 
data comparisons to compute a partition it into nSUS sub-sequences which is minimal in size, if not 
necessarily in entropy; and n(l + ft(SUS)) data comparisons (and 0{n + nSUS lg nSUS) internal op- 
erations) to merge the subsequences into a single ordered one. On the other hand, the combination 
of Lemma [2] with Lemma [7] yields a LRM-Partition in 2n data comparisons and 0(n) index opera- 
tions; which is then merged in n(l+ft(LRM)) data comparisons (and 0{n+ nRuns lg nRuns) internal 
operations) to merge the subsequences into a single ordered one. Comparing the 2n(l + ft(SUS) 
data comparisons of SUS-Sorting with the n(3 + ft(LRM)) data comparisons of LRM-Sorting shows 
that on instance where ft(LRM) < 2ft(SUS) — 1, LRM-Sorting performs less data comparisons 
(only potentially twice less, given that ft(SUS) < ft(LRM). This comes to the price of potentially 
more internal operations: SUS-Sorting performs 0(n + nSUSlg nSUS) such ones while LRM-Sorting 
performs 0{n + nRuns lg nRuns) such ones, and nSUS < nRuns by definition. 

When considering external memory, this is important in the case where the data does not fit in 
main memory while the internal data-structures (using much less space than the data itself) of the 
algorithms do: then data comparisons are much more costly than internal operations. Furthermore, 
we show in the next section that this difference of performance implies an even more meaningful 
difference in the size of the permutation encodings corresponding to the sorting algorithms. 
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5 Compressing Permutations 



As shown by Barbay and Navarro [2j, sorting opportunistically in the comparison model yields a 
compression scheme for permutations, and sometimes a compressed succinct data structure sup- 
porting the direct and inverse operators in reasonable time. We show that this time again the 
sorting algorithm of Theorem [4] corresponds to a compressed succinct data structure for permu- 
tations which supports the direct and reverse operators in good time, while often using less space 
than previous solutions. The essential component of our solution is a data structure encoding the 
LRM-Partition. In order to apply Lemma [5j our data structure must support two operators in 
good time: 

• the first operator, map(i), consists of indicating, for each position i E [l..n] in the input per- 
mutation 7r, the corresponding subsequence s of the LRM-Partition, and the relative position 
p of i in this subsequence; 

• the second operator, unmap(s,p) is just the reverse of the previous one: given a subsequence 
s € [L.nRuns] of the LRM-Partition of it and a position p 6 [l..ra s ] in it, the operator must 
indicate the corresponding position i in ir. 

We obviously cannot afford to rewrite the numbers of ir in the order described by the partition, 
which would use nlgn bits. A naive solution would be to encode this partition as a string S 
over alphabet [L.nRuns], using a succinct data structure supporting the access, rank and select 
operators on it. This solution is not suitable as it would require at the very least n^(Runs) bits 
only to encode the LRM-Partition, making this encoding worse than the nRuns compressed succinct 
data structure |2|. We describe a more complex data structure which uses linear space, and supports 
the desired operators in constant time. 

Lemma 8. Let P be a LRM-Partition consisting of nRuns subsequences of respective lengths LRM, 
summing to n. There is a succinct data structure using 2(n + nRuns) + o(n) bits and supporting 
the operators map and unmap on P in constant time. 

Proof. The main idea of the data structure is that the subsequences of a LRM-Partition for a 
permutation ir are not as general as, say, the subsequences of the partition into nSUS up-sequences. 
For each pair of subsequences (u,v), either the positions of u and v belongs to distinct intervals of 
7r, or the values corresponding to u (resp. v) all fall between two values from v (resp. u). 

As such, the subsequences of the LRM-Partition can be organized into a forest of ordinal trees, 
where the internal nodes of the trees correspond to the nRuns subsequences of the LRM-Partition, 
organized so that u is parent of v if the positions of v are contained between two positions of 
u, and where the leaves of the trees correspond to the n positions in tt, children of the in- 
ternal node u corresponding to the subsequence they belong to. For instance, the permutation 
tt = (4, 5, 9, 6, 8, 1, 3, 7, 2) has a unique LRM-Partition {(4, 5, 6, 8), (9), (1, 3, 7), (2)}, whose encod- 
ing can be visualized by the expression (45(9)68) (137) (2) and encoded by the balanced parenthesis 
expression (()()(())()())(()()())(()) (note that this is a forest, not a tree, hence the excess of '('s 
versus ')'s is going to zero several times inside the expression). 

Given a position i S [l..n] in ir, the corresponding subsequence s of the LRM-Partition is simply 
obtained by finding the parent of the i-th leaf, and returning its preorder rank among internal 
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nodes. The relative position p of i in this subsequence is given by the number of its left siblings 
which are leaves. Given a subsequence s 6 [L.nRuns] of the LRM-Partition of tt and a position 
p 6 [l..n s ] in it, the corresponding position i in tt is computed by finding the s-th internal node 
in preorder, selecting its p-th child which is a leaf, and computing the preorder rank of this node 
among all the leaves of the tree. 

We represent such a forest using a Balanced Parentheses Sequence using 2(n + nRuns) + o{n) 
bits and enhance it with a o(n)-bit succinct index [24] supporting in constant time the operators 
rank and select on leaves (i.e., on the pattern '()'), and rank and select on internal nodes (i.e., on 
the pattern '(('). With these operators we can simulate all operations described in the previous 
paragraph. ■ 

Given the data structure for LRM-Partitions from Lemma[8j applying the merging data structure 
from Lemma [5] immediately yields a compressed succinct data structure for permutations. Note 
that this encoding is not a succinct index, so that it would not make any sense to measure its space 
complexity in term of data and index complexity. 

Theorem 5. Let tt be a permutation of size n and P a LRM-Partition for tt consisting of nRuns 
subsequences of respective lengths LRM. There is a compressed succinct data structure using (1 + 
%(LRM))(n + o(n)) + O(nRunslgn) bits, supporting the computation of tt{i) and vr _1 (i) in time 
0(l+lg nRuns) in the worst case, and in time 0(1+%(LRM)) on average when i is chosen uniformly 
at random in [l..n], and which can be computed in the times indicated in Theorem^ summing to 
0(n(l + «(LRM))). 

Proof. Lemma [8] yields a data structure for a LRM-Partition of tt using 2(n + nRuns) + o(n) bits, 
and supports the map and unmap operators in constant time. The merging data structure from 
Lemma [5] requires (1 + %(LRM))(n + o(n)) + O(nRunslgn) bits, and supports the operators ttQ 
and 7r () in the time described, through the additional calls to map and unmap. Summing both 
spaces yields the desired final space. ■ 



6 Conclusion and Future Work 



One additional result not described here is how to take advantage of strict runs, in addition of 
taking advantage of general runs, for LRM sorting and encoding of permutation. Another related 
result is a variant of LRM- Trees, Roller Coaster Trees (RC-Trees), which take advantage of per- 
mutations formed by the combinations of ascending and descending runs. This approach is trivial 
when considering subsequences of consecutive positions, gets slightly technical when considering the 
insertion of descending runs, and requires new techniques to adapt the compressed succinct data 
structure to this new setting. Since the optimal partitioning into up and down sequences when 
considering general subsequences requires exponential time, RC-Sorting seems a much desirable 
improvement on merging ascending and descending runs, as well as a more practical alternative to 
SMS-Sorting, in the same way as LRM- Tree improved on Runs-Sorting while staying more practical 
than SUS-Sorting. Another result to come is the generalization of our results to the indexing, sort- 
ing and compression of general sequences (i.e., also to integer functions), taking advantage of the 
redundancy in a general sequence to sort faster and encode in even less space, in function of both 
the entropy of the frequencies of the symbols and the entropy of the lengths of the subsequences 
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of the LRM-Partition. Finally, studying the integration of those compressed data structures into 
compressed text indexes like suffix arrays |21| is likely to yield interesting results, too. 
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